Add Genesis schemas and fix doc deployment#50
Conversation
New domain schemas covering DOE Genesis challenges: - Energy: grid, subsurface_energy - Nuclear: fusion, nuclear_cleanup, nuclear_forensics - Earth Science: subsurface, hydrogeology, remote_sensing, water_resources - Computing: data_centers, microelectronics, quantum - Industry: manufacturing, construction, unconventional_resources - Materials/Physics: computational_materials, particle_physics - Bioprocessing/Labs: biomanufacturing, autonomous_labs Also updates the main valuesets.yaml imports. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Generated Python rich enums, datamodel, and merged schema for the 19 new Genesis Mission schema modules. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
deploy-docs: Run gen-project before gen-doc so element pages for all schema modules (including new ones) are generated before mkdocs builds the site. This fixes broken links on the live site. regen-derived: Add gen-sssom step to regenerate SSSOM ontology mapping files alongside project and doc files. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
Claude finished @cmungall's task —— View job PR Review AnalysisI've completed my review of this large merged PR that adds 19 Genesis Mission schema modules and fixes documentation deployment. Here's my comprehensive feedback: 🎯 Overall Assessment: Good Quality ImplementationThis is a well-structured addition that follows project conventions and successfully addresses the broken link issue. ✅ StrengthsCode Quality & Best Practices
Generated Code Quality
Workflow Improvements
|
There was a problem hiding this comment.
Pull request overview
This pull request adds 19 Genesis Mission schema modules covering DOE energy/grid, nuclear, earth science, computing, industry, materials/physics, and bioprocessing/labs domains. It also fixes the documentation deployment workflow and enhances the regen-derived workflow.
Changes:
- Adds 19 new schema YAML files with corresponding generated Python enums and datamodel
- Fixes deploy-docs workflow to run gen-project before gen-doc, ensuring element pages are generated
- Adds gen-sssom to regen-derived workflow for regenerating SSSOM ontology mapping files
Reviewed changes
Copilot reviewed 43 out of 46 changed files in this pull request and generated 4 comments.
Show a summary per file
| File | Description |
|---|---|
| src/valuesets/schema/valuesets.yaml | Adds imports for all 19 new schema modules |
| src/valuesets/schema/physics/particle_physics.yaml | Particle accelerators and fundamental particles schema |
| src/valuesets/schema/materials_science/computational_materials.yaml | Computational materials methods and property prediction schema |
| src/valuesets/schema/lab_automation/autonomous_labs.yaml | AI-driven autonomous laboratories schema |
| src/valuesets/schema/industry/unconventional_resources.yaml | Unconventional mineral resources and recovery methods schema |
| src/valuesets/schema/industry/manufacturing.yaml | Advanced manufacturing processes schema |
| src/valuesets/schema/industry/construction.yaml | Building systems and construction schema |
| src/valuesets/schema/energy/subsurface_energy.yaml | Subsurface energy resources and storage schema |
| src/valuesets/schema/energy/nuclear/nuclear_forensics.yaml | Nuclear forensics and attribution schema |
| src/valuesets/schema/energy/nuclear/nuclear_cleanup.yaml | Nuclear site cleanup and restoration schema |
| src/valuesets/schema/energy/nuclear/fusion.yaml | Fusion energy concepts and methods schema |
| src/valuesets/schema/energy/grid.yaml | Electrical grid components and management schema |
| src/valuesets/schema/earth_science/water_resources.yaml | Water resource types and quality (missing imports section) |
| src/valuesets/schema/earth_science/subsurface.yaml | Subsurface characterization and geophysical methods (missing imports section) |
| src/valuesets/schema/earth_science/remote_sensing.yaml | Remote sensing platforms and data types (missing imports section) |
| src/valuesets/schema/earth_science/hydrogeology.yaml | Aquifer types and groundwater processes (missing imports section) |
| src/valuesets/schema/computing/quantum.yaml | Quantum computing concepts schema |
| src/valuesets/schema/computing/microelectronics.yaml | Semiconductor and microelectronics schema |
| src/valuesets/schema/computing/data_centers.yaml | Data center infrastructure schema |
| src/valuesets/schema/bioprocessing/biomanufacturing.yaml | Biomanufacturing scales and processes schema |
| src/valuesets/enums/* | Generated Python enums from schemas |
| src/valuesets/enums/init.py | Updated with all new enum imports |
| src/valuesets/enums/mining_processing.py | Enhanced ExtractableTargetElement with detailed descriptions and ontology mappings |
| .github/workflows/deploy-docs.yaml | Runs gen-project before gen-doc and adds --force flag to deployment |
| .github/workflows/regen-derived.yaml | Adds gen-sssom command to regenerate SSSOM files |
| description: >- | ||
| Value sets for water resource types, water use categories, and water quality | ||
| relevant to the DOE 'Predicting U.S. Water for Energy' challenge | ||
| id: https://w3id.org/valuesets/earth_science/water_resources |
There was a problem hiding this comment.
The imports: section is missing from this schema file. All LinkML schema files in this codebase should have an imports: section that includes at least - linkml:types. This is consistent with all other schema files in the repository and is required for proper schema processing. Add the missing imports section after the id field and before the prefixes section.
| description: >- | ||
| Value sets for subsurface characterization, geophysical methods, and formation types | ||
| relevant to the DOE GEO-AI lighthouse and subsurface energy programs | ||
| id: https://w3id.org/valuesets/earth_science/subsurface |
There was a problem hiding this comment.
The imports: section is missing from this schema file. All LinkML schema files in this codebase should have an imports: section that includes at least - linkml:types. This is consistent with all other schema files in the repository and is required for proper schema processing. Add the missing imports section after the id field and before the prefixes section.
| title: Hydrogeology Value Sets | ||
| description: >- | ||
| Value sets for hydrogeological characterization, aquifer types, and groundwater processes | ||
| id: https://w3id.org/valuesets/earth_science/hydrogeology |
There was a problem hiding this comment.
The imports: section is missing from this schema file. All LinkML schema files in this codebase should have an imports: section that includes at least - linkml:types. This is consistent with all other schema files in the repository and is required for proper schema processing. Add the missing imports section after the id field and before the prefixes section.
| description: >- | ||
| Value sets for remote sensing data types and platforms used in earth observation | ||
| and subsurface characterization | ||
| id: https://w3id.org/valuesets/earth_science/remote_sensing |
There was a problem hiding this comment.
The imports: section is missing from this schema file. All LinkML schema files in this codebase should have an imports: section that includes at least - linkml:types. This is consistent with all other schema files in the repository and is required for proper schema processing. Add the missing imports section after the id field and before the prefixes section.
Summary
Three commits:
Add 19 Genesis Mission schema modules - The YAML source schemas for energy/grid, nuclear, earth science, computing, industry, materials/physics, and bioprocessing/labs domains. Also updates
valuesets.yamlimports.Add generated enums and datamodel - Rich Python enums, datamodel, and merged schema generated from the new schemas.
Fix deploy-docs and enhance regen-derived workflows:
deploy-docs.yaml: Rungen-projectbeforegen-docso element pages for all schema modules are generated before mkdocs builds. This fixes the broken links on the genesis-mission overview page (element pages were missing becausegen-docalone doesn't process new schema modules into the merged schema).regen-derived.yaml: Addgen-sssomto regenerate SSSOM ontology mapping files.Root cause of broken links
The genesis-mission.md page linked to element pages like
AquiferType.md, but those element pages were never generated in CI because:gen-doc(notgen-project), so the merged schema didn't include new modulesTest plan
🤖 Generated with Claude Code